Combining Dependency and Constituent-based Resources for Structure Disambiguation

نویسندگان

  • SOFÍA N. GALICIA-HARO
  • ALEXANDER GELBUKH
  • IGOR A. BOLSHAKOV
  • Juan de Dios
چکیده

Unrestricted text analysis requires an accurate syntactic analysis but structural ambiguity is one of the most difficult problems to resolve. Researchers have tried different approaches to obtain the correct syntactic structure from analyzed sentences but not successful results have been obtained. Two different approaches have traditionally applied to syntactic analysis: constituent grammars and dependency grammars. We propose a model for syntactic analysis and disambiguation combining lexical dependencies and semantic proximity. Lexical dependencies are applied by means of a government pattern dictionary following the dependency approach. The semantic proximity is introduced by means of semantic closeness among constituents. Examples are given to illustrate method’s contributions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

تبدیل خودکار درخت‌بانک وابستگی فارسی به درخت‌بانک سازه‌ای

There are two major types of treebanks: dependency-based and constituency-based. Both of them have applications in natural language processing and computational linguistics. Several dependency treebanks have been developed for Persian. However, there is no available big size constituency treebank for this language. In this paper, we aim to propose an algorithm for automatic conversion of a depe...

متن کامل

Combining resources for MWE-token classification

We study the task of automatically disambiguating word combinations such as jump the gun which are ambiguous between a literal and MWE interpretation, focusing on the utility of type-level features from an MWE lexicon for the disambiguation task. To this end we combine gold-standard idiomaticity of tokens in the OpenMWE corpus with MWE-type-level information drawn from the recently-published JD...

متن کامل

Combining Dependency and Constituent-based Syntactic Information for Anaphoricity Determination in Coreference Resolution

This paper systematically explores the effectiveness of dependency and constituent-based syntactic information for anaphoricity determination. In particular, this paper proposes two ways to combine dependency and constituent-based syntactic information to explore their complementary advantage. One is a dependency-driven constituent-based structured representation, and the other uses a composite...

متن کامل

Non-constituent coordination and other coordinative constructions as Dependency Graphs

This paper proposes a new dependency-based analysis of coordination that generalizes over existing analyses by combining symmetrical and asymmetrical analyses of coordination into a DAG structure. The new joint structure is shown to be theoretically grounded in the notion of connections between words just as the formal definition of other types of dependencies. Beside formalizations of shared d...

متن کامل

Automatic Semantic Role Labeling using Selectional Preferences with Very Large Corpora

OF PhD THESIS Automatic Semantic Role Labeling using Selectional Preferences with Very Large Corpora Determinación Automática de Roles Semánticos usando Preferencias de Selección sobre Corpus muy Grandes Graduated: Hiram Calvo Center for Research in Computing (CIC) National Polytechnic Institute (IPN) Mexico City, Mexico, 07738 [email protected] [email protected] Graduated on June 19th, 2006...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006